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Abstract 

Information about primary transmitter location is crucial in enabling several key capabilities in 
cognitive radio networks, including improved spatio-temporal sensing, intelligent location-aware routing, 
as well as aiding spectrum policy enforcement. Compared to other proposed non-interactive localization 
algorithms, the weighted centroid localization (WCL) scheme uses only the received signal strength 
information, which makes it simple to implement and robust to variations in the propagation environment. 
In this paper we present the first theoretical framework for WCL performance analysis in terms of 
its localization error distribution parameterized by node density, node placement, shadowing variance, 
correlation distance and inaccuracy of sensor node positioning. Using this analysis, we quantify the 
robustness of WCL to various physical conditions and provide design guidelines, such as node placement 
and spacing, for the practical deployment of WCL. We also propose a power-efficient method for 
implementing WCL through a distributed cluster-based algorithm, that achieves comparable accuracy 
with its centralized counterpart. 

I. Introduction 

Cognitive Radio (CR) is a promising approach to efficiently utilize the scarce RF spectrum resource. In 
this paradigm, knowledge about spectrum occupancy in time, frequency, and space that is both accurate 
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and timely is crucial in allowing CR networks to opportunistically use the spectrum, and avoid interference 
to a primary user (PU). In particular, information about PU location will enable several key capabilities 
in CR networks including improved spatio-temporal sensing, intelligent location-aware routing, as well 
as aiding spectrum policy enforcement. 

The localization problem in CR networks is in general different from localization in other applications 
such as Wireless Sensor Networks (WSN) and Global Positioning System (GPS), in which the target to be 
localized cooperates with the localization devices. In contrast, a PU does not communicate directly with 
the CRs during the localization process. This scenario is referred to as non-cooperative localization and 
does not allow the use of conventional techniques such as time-of-arrival (TOA) and time-delay-of-arrival 
(TDOA) ranging. 

A. Related Work 

Several techniques have been proposed in the literature for estimating the position of active PUs in 
a CR network. These schemes include both range-based J21, Q and range-free Bl- lTTOl algorithms. 
Range-free positioning schemes, such as centroid localization |5 ], have attracted a lot of interest because 
of their simplicity and robustness to changes in wireless propagation properties such as path loss. This 
characteristic makes them suitable candidates for systems requiring coarse-grained, but reliable and cost- 
effective techniques. On the other hand, range-based techniques such as multi-lateration |2), offer better 
estimates, but require accurate information about the target's distance that can only be achieved either 
through cooperation with the target or a very precise knowledge of the path loss model. 

In this paper, we focus on analyzing the performance of a particular, low complexity, coarse-grained 
localization algorithm, referred to as Weighted Centroid Localization (WCL) iTTTTl . In this technique, PU 
location is approximated as the weighted average of all secondary user positions within its transmission 
range. Strictly speaking, WCL is not an entirely range-free technique because it requires additional 
information aside from simple connectivity, namely Received Signal Strength (RSS) measurements. WCL 
and WCL-like methods are suitable for PU localization because of their low computational complexity. 
They also do not require cooperation from the PU and only use the readily available RSS information. 
In addition, WCL always converges to a solution, as opposed to some range-based techniques, and do 
not require estimation of the path loss exponent beforehand. 

Recent papers on WCL have proposed improvements on the basic algorithm to enhance its performance 
for specific scenarios [ 12 ]— [ 15 ]. All previous WCL studies reported in the literature have been analyzed 
either via simulation or field experiments ifTTI . fTo*! . To the best of our knowledge, there has been no 
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analysis of the probability distribution of the localization error for even the simplest version of WCL in 
the presence of shadowing in both WSN and CR literature. 

B. Contributions 

We present the first theoretical analysis of the error distribution of WCL in this paper. The presented 
analytical framework models varying levels of shadowing, including both independent and correlated 
shadowing environments, and also takes inaccuracy of sensor node positioning into consideration. Using 
this analysis in conjunction with numerical simulations we observe and quantify the robustness of WCL 
to variable channel conditions and node placements. We also investigate variable node participation as a 
simple modification that can improve WCL performance. 

In the latter part of this paper we propose and investigate a distributed cluster-based implementation 
of WCL in the context of CR networks. Previous work primarily assumes a centralized fusion node 
aggregating data from other nodes. In the context of WSN where WCL was first utilized ifTTTl . the 
primary motivation was node self-localization where a node applies WCL using measurements from 
beacon nodes in close proximity to it. This approach naturally dictates a centralized system where the 
node to be localized acts as the fusion center and only requires minimal interaction with the beacons. 
In the case of PU localization in CR networks, however, where the primary motivation is estimating the 
position of a non-cooperative node, there no longer exists any particular node that has all the requisite 
information to perform WCL. In this case a fusion node is arbitrarily chosen. This approach does not 
scale very well in terms of communication overhead and transmit power, which limits its application to 
low node densities or small network size. 

There is therefore a need to create distributed implementations of WCL for this technique to be 
practically applied in CR localization. A standard approach for distributing control in sensor networks is 
clustering. Several algorithms exist in the literature for creating either disjoint ifTTl . ifTHl or overlapping 
|[T9l clusters. In [20], a distributed technique based on the concept of mean-shift II2T1 was proposed. 
However, the technique requires repeatedly forming a virtual cluster until it converges to the target 
location. This introduces additional communication overhead and also requires several iterations to 
converge. We present an implementation which saves total transmit power needed for localization process 
communication, and investigate its accuracy as well as its effect on per node computational complexity 
and total transmit power consumption. 

The rest of the paper is organized as follows. The system model is introduced in Section ITT] The 
theoretical analysis of centralized WCL accuracy is presented in Section [Till while a practical and 
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distributed implementation of WCL is proposed and analyzed in Section [IVJ Numerical results for various 
scenarios used to evaluate performance of WCL algorithms are presented in Section [V] Finally, the paper 
is concluded in Section [VT] 



This section presents the general framework for our analysis. We first review the assumptions about 
the system setting and propagation model, and then present the basic WCL algorithm. 

We adopt the PU-centric model |[T3l in which the PU transmitter is assumed to be at the center area 
of a circle. The rationale behind this choice is that in a practical scenario, sensor nodes that obtain 
enough received power (i.e. above sensitivity threshold) approximately form a circle around the primary 
transmitter. Thus, sensors within this circle are used to jointly estimate the position of the PU. An 
implicit assumption in this model is that the PU is surrounded by enough nodes so that edge effects can 
be disregarded. 

Assume N sensor nodes in a circle of fixed radius R. The number of sensors within the circle is 
jointly affected by several factors, such as transmit power of the PU (which effectively determines R), 
node density, node selection methodology, etc. N is a design parameter for WCL deployment, the choice 
of which will have significant influence on the localization accuracy, as will be shown in our analysis. 

The PU located at the center has a coordinate defined as L p = [x p , y p 

] T = [0,0] T . For the ith sensor 

node, we define its 2-dimensional location as Lj = [xi,yj] T where xf + y f < R 2 . Sensors are placed 
independently in both dimensions. We assume sensors obtain information about their own locations, which 
is modeled as imperfect. The measured location of the ith sensor is distributed as Lj ~ M (Li, af I2), 
where 07 defines standard deviation of the error in sensor node self-localization. We further assume the 
self-localization errors are independent among sensors and are independent from received powers. 

We adopt the following channel model for received signal strength used in WCL. The received power 
of the ith node from the primary transmitter, Pi, is given by 



in which the first two terms characterize path loss and the last term describes shadowing effect. The path 
loss is characterized by three constants: reference power Pq, reference distance d$ and path loss exponent 
7. Denote collection of shadowing variables as s = [si, S2, ■ ■ ■ , sjy], then its distribution is characterized 
by s ~ N(0, fi s ). We consider the following shadowing cases 



II. System Model 




(1) 




(2b) 



(2a) 
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where X c is the correlation distance E2l within which the shadowing effects of nodes are correlated, 
and In is the identity matrix with dimension N. 

The theoretical analysis presented in Section HIT] is based on the centralized architecture, which will 
also give us an upper bound on the performance. In each localization period, all sensors involved in 
the WCL will send their RSS measurements to the fusion center, where an estimate will be made about 
the coordinate of the primary transmitter. The considered algorithm is a modification of relative-span 
WCL proposed in |[T2l . In this scheme, the weights are guaranteed to be non-negative in dB scale. The 
2-dimensional estimated location of PU is formed as 

£ _ Sj=l w i^i _ Ya=1 {Pi ~ -Pmin) \h ^ 
Sj=l w i ^2i=l\Pi ~ Prom) 

The weight factor of node i is defined as Wi = (Pi — P m - m )/ AP, i = 1, 2, . . . , N, where P mm is a constant 
calculated from average received power of the node on the border with some margin, to guarantee the 
probability that receive power of all nodes in the area below P mm is sufficiently small, say 1%. P max 
is the maximum received power among nodes, and AP = P max — P m m is the span of received power. 
The localization error is then given by = L p — L p = L p = [x p ,y p ] T , where x p and y p are the 
1 -dimensional errors along the x and y-axis respectively. Finally, the performance of the WCL scheme 
is evaluated by distance error, which is given by 



eL = Jxl + yl = \\e L \\ 2 . (4) 



III. Theoretical Analysis of WCL Error Distribution 

This section presents the analysis of WCL error using probability theory, ultimately leading to analytical 
expressions for the error distribution, first of WCL applied to 1 -dimensional coordinates, and finally the 
2-dimensional WCL error given by (@]). 

A. 1 -dimensional Location Estimation Error 

1) I.I.D Shadowing: First we derive the 1-dimensional localization error for the i.i.d. shadowing case. 
We define a constant ^ for each node i as 



fli = P - 10 7 log ^ " ^ j ~ Pram, (5) 

One can verify that P$ — P m \ n = fa + Sj ~ M {fa,(xf\. The location estimation in one dimension, say, 
x-axis, is naturally given by 

a _ Si=l {Pi ~ Pmia) x i a_ a ✓gs 



ZlAPi-Prain) " & ' 
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Based on our i.i.d assumption among all Pj's, one can verify that 

For the statistical property of a, since Xj's are also i.i.d and are independent of Pj's, the mean of a is 
given by m a = YliLi fH%i- The variance of the ith term of a is given by 

Var[(Pi - Pmin)^] = E[(Pi - P min ) 2 ]E[x 2 ] - E[P< - F^Efo] 2 

= ^ 2 /i 2 + a 2 x 2 + ^- (8) 

Thus the variance of a is derived as cr 2 = Nafa 2 s + J2iLi( a ?Pi + °s^f)- From central limit theorem, 
we approximate a as a single Gaussian variable, i.e. a ~ AA(m a ,cr 2 ), which is verified by numerical 
simulations that are not presented here due to space limit. 

Since a and b are correlated due to summation over the same set of variables, i.e. Pj's, x p is a ratio 
of two correlated Gaussian variables. In order to obtain statistical properties of x p , we first calculate 
correlation coefficient p ab of a and b, which is given by 



Pab = E [(a — m a )(b — m b )] / (cr a a b ) = E 

Note that 

E [xiSiSj] = 



N N 

XiSiSj 

i=i j=i 



/(Wb). (9) 



XiCJ s , i=j, (10a) 
0, i ? j, (10b) 



thus the correlation coefficient is obtained as 



2pN - 11—11 
Pab — — ] i (11) 

where x = [xi, . . . xat] t , and ||x||j is the Zth norm of the vector. 

From the previous analysis we know that x p is the ratio of two dependent Gaussian random variables 
with non-zero means for which the closed-form pdf is available in |[23l Eqn 7.14]. Based on the pdf of 
x p , we calculate its mean m$ p and variance cr| by numerical integration. Alternatively, we also calculate 
rrix p and cr| through approximation results of |24], which assume x p follows normal distribution. The 
approximation results are given by 

~m £p ~ (m a /m b ) + a b m a /m b - p a bcr a cr b /ml 

cr l„ - <T b m l/ m b + a l/ m b ~ 2p ab a a a b m a /ml. (12) 
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The Gaussian approximation (fT2l is verified via numerical simulations. Specifically, the mean and variance 
calculated from integration of the closed-form pdf are compared with the approximated mean and variance, 
for different node numbers. Our results show that for N > 30, the approximation error for the means is 
essentially zero and less than 3% for the variances. 

Up to this stage, we are able to calculate the mean and variance of the estimation error in the x-axis 
as m ex = m^ p — x p = m^ p , = a\ . The same analysis applies for calculating the mean and variance 
of the estimation error along the y-axis, namely, m ey and a\ . Note that the estimation errors in x-axis 
and y-axis are correlated, therefore the statistical properties of the 2-dimensional error, e^, cannot be 
obtained directly. 

2) Correlated Shadowing: In this case, since Pj's in (J6]) are correlated, thus the distribution of x p 
needs to be evaluated again. Since b in © is the linear combination of correlated Gaussian variables, it 
is still Gaussian distributed (251, i-e. b ~ M (m b ,a b ). Thus, m b = Y^^i M»> an< ^ a b * s gi ven by of = 
2~2i=i Ylf=i ^ij> where Ry is the ijth element of the covariance matrix of [(Pi — P m in) ,•••■> (Pn — Pmh 
defined as 

Rij = E [(Pi - Pmin - fli) (Pj - P m in ~ fJ>j)] = E[sjSj] = O^Ay, (13) 

where X tj = e -\\^-^\\/x c _ 

For the distribution of a, it follows directly that m a = Ylu=i %ifH- lts variance is given by of = 
Yli=i Y^j=i where R£- is the ijth element of the covariance matrix of [(Pi — P m i n )xi, . . . , (Pn — 
Pmin)xN] T , which can be represented by 

Rjj — E{[(Pj Pmin)%i %iPi\[(Pj Pmin)%j ^jPjW — E[xj SiXjSj\ , 

where K[xiSiXjSj] is given by 

( a 2 s (xf + af), i = j, (15a) 

^XiXjCTsXij, i^j. (15b) 
Based on central limit theorem, we approximate a as a Gaussian random variable, i.e. a ~ AA(m a ,of ). 
The approximation is validated via simulations, and it applies for N > 20. 
The correlation coefficient p ab is given by 

N N 

Pab = ^[(a-m a )(b-m b )}/(a a a b ) = (of ^^XiA^) / (a a a b ). (16) 

i=i j=i 

Substituting the expressions for a a and a b into (fT6l ). after some simple manipulation, 

l T Ax 

Pab = j = , (17) 

' (x T Ax + Na 2 s af )1 T A1 



s 



where {A}^ = and 1 is a N x 1 vector of all l's. 

Since the location estimate x p is formed by x p = a/b, we use the closed-form pdf given by 
Eqn 7.14] to calculate m Xp and c| by numerical integration. Gaussian approximation can also be used 
similarly to i.i.d shadowing case in Sec. IIII-A1I 



B. 2 -dimensional Location Estimation Error 

In our previous analysis, we obtained the statistical properties of 1 -dimensional location estimation error 
for both i.i.d. and correlated shadowing case. From Gaussian approximation of 1 -dimensional localization 
error (fT2l ). we know that ~ A/"(et, Vt£), where ex = [m Xp ,m,y p ] T , 



n, 



a 



tip 



(18) 



and p Xp y p is the correlation coefficient between 1 -dimensional errors, the calculation of which is presented 
in the Appendix. 

The calculation of localization error involves computing 2-norm of e^. Since the two variables 
in are dependent Gaussian r.v.'s with different variances, we perform the following transformations. 
First we form two independent Gaussian variables from e^, by the standard technique of decorrelation 

iT 



ESI , e' L = QeL = [x' p ,y p ] , where Q is the orthogonal eigenvalue decomposition matrix of fi^, which 
satisfies Qf2^Q T = D, where D is a diagonal matrix. Therefore, one can verify that [25 , Eqn 2.1-159] 
e' L ~ M (Qe^, D). Then by the orthogonal-invariant property of 2-norm (25l, we know = ||ex|| 2 = 



e£Q^Qe L = ||e' 



Lll2- 



In order to obtain the pdf of ||e' L || 2 , let us first consider pdf of ||e' L || 2 = x p + y p . x p and y p are two 
chi-square variables with one degree of freedom. Then the pdf of U = He^H^ is the convolution of these 



two chi-square pdfs, denoted as fu(u)- The pdf of the localization error 



J Lll2 



V is given by 



fv{v) = 2vfu(v 2 ). As a result, the closed-form pdf of e^, for given sensor node positions, is given by 



Pe L (x) 



exp 



oo oo 



i=0 1=0 




T(i + l + 1/2 



m r af, 



H+l 



,x>0 (19) 



i\ l\ r(l/2 + \ 2m x al) \ 

where m x , m y and a x , a 2 are means and variances of x' p and y' p , respectively, and T(x) and I a (x) are 
Gamma function and ath-order modified Bessel function of the first kind, respectively. The mean and 
variance of ex, denoted as m eL and a\ , are calculated using their definitions, by numerical integration 
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over the above pdf. Note that the above calculations are based on a particular fixed node placement. If 
the nodes are randomly placed within the area according to some distribution, say uniform distribution, 
integrating 2N times over all node coordinates gives the theoretical average performance. 

IV. Distributed Implementation of WCL 

Based on the theoretical analysis, WCL requires large number of nodes in order to achieve a high level 
of accuracy. This causes large communication overhead in terms of number of messages and transmit 
power consumption when implemented in a centralized manner. We now present a practical distributed 
implementation of WCL to address this problem in the context of CR networks. 

A. Algorithm Description 

Clustering is commonly used in implementation of distributed algorithms. The challenge in designing 
these algorithms is to reduce the amount of information exchanged among clusters. The proposed 
algorithm accomplishes this by splitting the task into two phases. First, the cluster with the highest 
average RSS is selected. This cluster then acts as the coordinator for the second phase which performs 
the actual WCL algorithm. 

1) Clustering: Nodes, Nj, in the CR network are assumed to be organized into clusters, Cj, using 
the clustering algorithm presented in |fl"8ll . This particular algorithm is based on geographic boundaries 
and generates clusters which have a predefined shape (hexagonal in this case). The regularity in the 
shape of the cluster is beneficial for the first step of the distributed algorithm since WCL works best 
for a radially symmetric distribution of nodes. However, the proposed algorithm would work for any 
arbitrary clustering algorithm as long as nodes are clustered based on proximity. The PU is assumed to 
be randomly placed in the entire area and can thus be in any of these clusters. 

Each cluster selects one node, head(Cj), to be the cluster head. The cluster head maintains a table of 
all adjacent clusters, adj(Cj), and the location of its member nodes, L,(Nj),Nj £ Cj. 

Once a PU is detected by a node, using an existing PU detection method such as energy detection 
1126*11 , it transmits its RSS measurement to the corresponding cluster head which then participates in the 
distributed WCL algorithm outlined in the next section. Using a predetermined fusion rule, such as a 
majority vote, the cluster head makes a decision on the presence of the PU. All clusters that decide that 
the PU is present, participate in the Distributed WCL algorithm detailed in the next subsection. These 
clusters form a set defined as C ac tive- If in addition, some prior knowledge of the PU transmit power 
is available, a situation typical of a CR network coexisting with a licensed incumbent network, then a 
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predetermined threshold can be used to further reduce the number of clusters in C 0CfiVe . The threshold 
can be chosen based on the average node spacing to get a certain desired average number of nodes. We 
investigate the effect of this number on WCL accuracy in Sec. IV-B6I 

2) Distributed WCL (DWCL): The first phase, which selects the head cluster, is shown in Algorithm 
[TJ The objective is to select the cluster with the highest average RSS, P(Q). This is done in a distributed 
manner by comparing the average RSS between neighboring clusters. However, directly implementing 
this procedure would require a large amount of inter-cluster communication. 

To solve this issue, a cluster only communicates its average RSS to the neighboring cluster, next(Cj), 
in the direction of the PU relative to the cluster's centroid. 

Although techniques for estimating the angle of arrival of a transmitter could be beneficial, these 
methods require cluster heads to have multiple antennas which increases complexity of sensors. In order 
to get an approximation of the PU direction, we propose a metric, L(Cj), shown in Line 5 of Algorithm 
Q] We get L(Cj) by subtracting the WCL result of all nodes within the same cluster from the geometric 
centroid of those nodes. This vector approximates the RSS gradient within the cluster. It corresponds to 
an approximate direction of the PU since on average the RSS in (fl]) decreases as distance to PU increases. 
Therefore, the negative of the gradient is directed towards the point with maximum RSS. 

Once the cluster with the highest average RSS is selected, Algorithm |2] is performed. In this step, the 
final WCL calculation is done using nodes within a given radius, R*, of the selected cluster's strongest 
node (SN), Ns- R* is adaptively chosen to be the minimum of the distance to the closest map edge, 
edge(A^s), and the cluster radius Rq. This reduces the border effect, which is the tendency of the location 
estimate to be biased towards the center when the PU is close to the map border, by ensuring a radially 
symmetric node distribution even when the PU is close to the network edge. 

In order to improve the localization accuracy, additional nodes from adjacent clusters are also included 
in the WCL calculation. Using the location of Ns and R*, these adjacent clusters can reduce the amount 
of information exchanged by only transmitting the location and RSS measurements of nodes that fall 
within this circle. In the case of large clusters, each cluster could also choose to calculate a weighted 
centroid using its own nodes and transmit the result of this calculation only. Since WCL is essentially 
an averaging algorithm, this technique allows even Algorithm [2] to be distributed. 

The power efficiency and computational complexity of this distributed approach could further be 
improved by taking into account a practical mobility model of the PU. There has been a lot of work such 
as |27l , which discussed how to incorporate past observations of the target position and a mobility model 
to improve localization. However, applying these methods introduces complexity to WCL and defeats 
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Algorithm 1 Head Cluster Selection 



l: for all a G C a 



2: 
3: 
4: 
5: 
6: 
7: 
8: 
9: 
10: 



P(Ci) <- 
L c (Ci) < 

t(Ci) <r 

next(Cj 



do 

average (P(A r J )) 
- average (L(Nj)) 

_ E w , £ c, [PW)-min(PW ))]L(JV J - ) 
Ei, jec ,(P(^)-min(P(iV,)) 
L c (Ci)-L m (Ci) 
HMCO-L^COH 



°L(d 



if P(Ci) > P(next(Cj)) then 

if P(Ci) > P((7,) V(7,- G adj(a 
Select Ci — > C wc ; 
Proceed to Algorithm |2] 



then 



> cluster centroid 
> cluster WCL 
> direction of gradient 



> cluster for final WCL 



Algorithm 2 Modified WCL 

> SN acts as center of WCL 
> border correction 

> add nodes from adjacent clusters 



l: function PollCluster(C t ,iV s ,i?*) 

2: return C* = {Nj\Nj G C L , ||L (N s ) - L (Nj)\\ < R*} 
3: end function 



1: N s <- argmax iVieCW; P(iVj) 

2: iT <- min(edge(iVs),ifc) 

3: for all Cj G adj(C wc /) do 

4: C* cI <= PollCluster (Cj ,N S ,R*) 

E Wj£C , ; [P(iV 3 0-min(P(iV j ))]L(JV 3 ) 
5: Les * * U„J e C c , |P(7V 3 )-min(P(7V,)j 



the purpose of such a low complexity scheme. 

Intuitively, assuming the PU continuously transmits during successive WCL estimations, a PU would 
most likely be in the same cluster as it was in the previous WCL calculation, or it could move to an 
adjacent cluster. By considering this fact, Algorithm Q] can further limit the clusters involved in choosing 
the head cluster, which is the most power consuming phase, by only taking the previous cluster head and 
its adjacent clusters as C ac tive- 
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B. Analysis of Communication Overhead 

We analyze the total transmit power and computational complexity of CWCL and the proposed DWCL 
algorithm. In order to do so, we first need to investigate the transmit power on a single link of length 
d. The minimum transmit power is obtained from (Q]) as Pt,rom = -fr,mmd 7 10~ s / 10 , where Pt jin in is the 
minimum received power to correctly receive the message. For the computation complexity estimates, we 
assume the number of operations (OPS) for addition, substraction and comparison is 1; for multiplication 
and division is 10. 

1) Centralized WCL: In CWCL, the total number of messages is the sum of reporting messages 
from nodes to the fusion center, thus, N. For the transmit power consumption, we assume the fusion 
center is roughly at the center of a circle area of radius R, and nodes are placed according to a uniform 
distribution. In this case, one can verify that, di, the distance from the ith node to the fusion center 



satisfies di ~ \/M(0, R 2 ), where U(a,b) is the uniform distribution from a to b. Thus the average total 
power consumption is expressed as 

N 

E[P t , c ] = E[P r , min ^(dJlO- s </ 10 )] « NPr^ndjMW-^/ 10 }, (20) 
i=l 

where we use statistical average to replace sum of realizations. 

Note that E[dJ] = E[(d^) 7//2 ], which is the 7/2th moment of an uniform variable. In order to calculate 
E[10~ Si / 10 ], we need the distribution of 10 _Si//1 °, which is a function of central Gaussian variable Sj. 
Thus we can derive the desired pdf as |[25l 

P X = i -in AT" eXP " 50 2~ ■ ( 21 ) 

In 10V2ttct s x \ o% J 
Finally, the average total power consumption is given by 

POO 

E[P t>c ]=NP rMn K[d]] xp(x)dx. (22) 

J o 

From the definition of WCL estimate, number of OPS of CWCL is £>(25iV). 

2) Distributed Algorithm: Assume that: M is the average number of nodes in one cluster; L = N/M 
is the average number of clusters; K is the number of neighboring clusters for each cluster; rj is the 
average percentage of clusters who decide they are the head cluster in step 7 of Algorithm 1 (clusters 
that have to perform step 8). 

The total number of messages for Algorithm 1 is 0(ML + 2L + 2rjKL). To calculate transmit power, 
in step 1) we use the same procedure as CWCL (with a smaller radius); step 2) and 3) are communication 
among cluster heads, we use the average distance between centers of two clusters as the estimate of the 
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average distance among cluster heads. In Algorithm 2, the total number of messages is 0{2K) and the 
calculation of power consumption for this process is the same as step 2) of Algorithm 1. 

In terms of computational complexity, the number of OPS for Algorithm 1 is 0(27N + 4AL + 64KL + 
rjKL) while the number of OPS for Algorithm 2 is 0(3AKM + 26M). 

V. Evaluation of WCL Performance 

In this section, we evaluate the performance of CWCL algorithm and its proposed distributed imple- 
mentation. For CWCL algorithm, using the theoretical framework presented in Section [TTlJ we consider 
impact of sensor node positioning error, different levels of shadowing a s and different correlation distances 
Xq. The effect of node placement is investigated under random grid and uniform distribution. 

A. Performance Metric 

Prior work on WCL uses mean localization error as the performance metric of interest. In fT3ll . a 
slightly modified performance metric where the mean error is normalized by the transmission range R of 
PU is adopted. Simulations in these prior work have shown that the accuracy of WCL improves as the 
density of nodes is increased. This result is intuitive since the node spacing decreases as the density of 
nodes is increased. In this work, we introduce a novel performance metric where the mean localization 
error m eL is normalized with respect to the average node spacing D. The motivation for choosing this 
metric is to eliminate the effect of geometry scaling which intuitively improves accuracy. 

B. Performance of CWCL Algorithms 

1) Impact of Shadowing: In order to isolate the effect of shadowing on WCL performance we consider 
a fixed grid placement of nodes with perfect positioning (i.e. oi = 0) and fixed PU position. The 
normalized mean localization error for this scenario is shown in Fig. [la] For given shadowing variance, 
as the number of nodes increases, there is a slight improvement in the accuracy, which is a consequence of 
averaged shadowing over many nodes. Note that this gain saturates at approximately 200 nodes or more. 
We also observe that WCL is quite robust to shadowing, since for an additional 7.5 dB of shadowing from 
2.5 dB to 10 dB, normalized error increases only 5%. Therefore WCL is well suited for PU localization 
in severely shadowed environments. 

2 ) Impact of Correlation: The results for mean localization error under correlated shadowing are shown 
in Fig. [Tbl assuming fixed value of a s = 4dB, and different levels of Xq normalized to D are presented. 
In highly correlated shadowing such as Xq > 5D, increasing the number of nodes always worsens the 
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Fig. 1. WCL analysis and simulation results for the normalized mean localization error in |(a)| uncorrelated and |(b)| correlated 
shadowing environment with varying levels of shadowing, a 3 . For this scenario, nodes have no positioning error (ai — 0m), 
and are placed on a grid with equal spacing (fixed grid placement) in an area with R = 100m. PU is at the center of the area. 



error. For the high node density case, e.g. 400 nodes, the average error is as high as 47% of D. We 
conclude that when all nodes are included in the WCL calculation, correlation has a degrading effect to 
accuracy. Further investigation shows that the correlation distance resulting in the worst performance is 
proportional to the number of nodes involved. 

3) Impact of Node Positioning Error: The impact of positioning accuracy of sensing nodes is presented 
in Fig.|2]with a fixed grid placement of nodes in an i.i.d. shadowing environment with a s = 5dB. Analysis 
shows that errors in sensor node positioning decrease the WCL accuracy but the impact is marginal. For 
example, o\ of 7m increases the error for about 1%. 

4 ) Impact of Node Placement: The previous subsections considered fixed node placement and PU 
location. Realistically, node placement can be random and the PU also appears at random locations 
relative to the sensor nodes. Next we consider two cases of random node placement: random grid and 
uniformly distributed placement. 

In the random grid scenario, the sensor nodes are still on a grid, but the PU can appear uniformly 
within the center box. A grid distribution of nodes can be practically implemented if nodes are explicitly 
deployed for the purpose of acting as a dedicated localization sensor network. Evenly placed beacons 
have been shown to provide better accuracy for WCL in WSNs fl6l . Results for random grid placement 
in Fig. [3] show approximately 10% performance loss compared to fixed PU location scenarios. 
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Fig. 2. WCL analysis and simulation results for the normalized mean localization error in an uncorrelated shadowing environment 
(er s = 5dB) as a function of variance in the participating nodes' positioning error, a;. For this scenario, nodes are placed on a 
grid with equal spacing (fixed grid placement) in an area with R = 100m. PU is at the center of the area. 



The uniformly distributed placement has been used in ifTTTl — lTT3Tl to more accurately model scenarios 
where the sensor node positions are not predefined, such as in mobile ad-hoc networks. The results for 
uniformly distributed placement are also shown in Fig. [3j It is observed that randomness in the node 
placement can increase the error as much as three times compared to the fixed node placement. 

5) Impact of Degree-of -Irregularity: The results given so far assumed an ideal circular coverage area 
for the PU. Works such as H, (5J indicated that in practical scenarios, the circular model cannot be 
guarranteed due to several non-idealities such as fading, shadowing, and interference in the system. 
Intuitively, the Degree-of-Irregularity {DOT) parameter determines how closely the coverage area can be 
approximated by a circle. DOI = means the transmission range is perfectly circular. The model used 
in our simulation is the DOI model briefly described in @ where the transmission range is a correlated 
Gaussian r.v. centered at R = 100m with variance a = R(DOI). 

In Fig. |4] the effect of DOI is investigated in the presence of uncorrelated medium level shadowing 
(o~ s = 4dB). The result shows that DOI between 0% and 40% increases the error by 4%. DOI is not 
a significant impairment for WCL because nodes closest to the PU, where the effect of irregularity is 
negligible, are weighted more than nodes closer to the edge of the coverage area. However, a slight 
increase in the normalized mean error can be observed as more nodes are used at high DOI (30% or 
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Fig. 3. WCL analysis and simulation results for the normalized mean localization error in a correlated shadowing environment 
(<r s = 4dB) with varying correlation distance normalized to the node spacing. Results of two placements, random grid (RG) 
and uniform random (UR), are shown in an area with R = 100m. For both placements, cr; = 0m. 



40%). The denser the node distribution, the more nodes are included in WCL, but the more nodes there 
are, the more deviation from a perfect circular coverage coverage area, leading to lower accuracy. 

6) Variable Participation: From Fig. [3] we observe that in correlated shadowing increasing the number 
of nodes also increases the error. One solution to this problem is to reduce the number of participating 
nodes used in WCL for this scenario. This could be achieved by ignoring nodes farther away from the 
PU to reduce the number of nodes resulting in less degradation. 

In Fig. |5]we investigate varying levels of node participation for a fixed average node spacing, D. In this 
case the total number of nodes is 100, and the density remains the same while the number of involving 
nodes is varied. To select which nodes are included in the WCL calculation, nodes are sorted according 
to their RSS values. The nodes with high received power are included in WCL. As a result, increasing 
the participation for the uncorrelated case always improves the accuracy. Thus, 100% participation is 
optimal. However, even with a little correlation, this number is greatly reduced and about 10% of the 
total number of nodes is optimal. Another selection scheme was previously proposed in ll28l where only 
nodes whose RSS is within 15% of the stronges node's RSS are used. 

7) Comparison With Other Localization Techniques: The localization accuracy of WCL is also com- 
pared to other techniques suitable for CR scenarios. This includes lateration, centroid and strongest node 
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Fig. 4. Simulation results for the normalized mean error in an uncorrelated shadowing environment with a a = 4dB with 
varying Degree-of-Irregularity (DOI). For this scenario, nodes are uniformly and independently distributed (uniform random 
placement) in an area with average R — 100 m. 




Fig. 5. Effect of varying the number of participating nodes to WCL accuracy, with different correlation distances (Xc) ■ In 
this scenario, N = 100 uniformly and independently distributed nodes (uniform random placement) are in range of the PU, 
while the percentage of these N nodes that are included in the WCL calculation is varied (R = 100 m, cr s = 4dB,<7i = m). 
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Fig. 6. Comparison of WCL performance with a range-based scheme (Lateration) and a range-free scheme (Centroid) 
under different shadowing variance, a s . For this scenario, nodes are uniformly and independently distributed (uniform random 
placement) in an area with R — 100m (uncorrelated shadowing). Note that the Centroid technique is unaffected by a s . 



(SN). Lateration is a range-based technique that uses least-squares estimation using distance estimates 
to the target. The centroid algorithm, on the other hand, is a range-free technique which calculates the 
average of all node positions in range of the PU. Finally, the SN technique simply selects the node with 
the highest RSS and uses its location as an estimate of the PU position. 

Results in Fig. [6] show that WCL performs better under higher a s compared to lateration. Although 
this seems counter-intuitive at first, the result can be explained by taking note that range-based techniques 
require an accurate estimate of the distance to the target to provide accurate results. In the case of non- 
interactive localization, where the only available information is the RSS from the PU, shadowing induces 
a very large error in the range estimates. Under low shadowing (< 2.5 dB), the lateration method is 
more accurate than WCL. At dB shadowing (not shown in the figure), the range-based technique will 
achieve perfect localization. This result supports the claim that WCL is a viable method in the context 
of non-interactive localization because of its robustness. The centroid algorithm performs slightly worse 
than WCL because it does not utilize RSS information. However, the performance difference becomes 
smaller at higher level of shadowing. 

C. Performance of DWCL algorithms 
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Fig. 7. Performance comparison of centralized/distributed WCL (CWCL/DWCL) and strongest node (SN) algorithms with 
varying a s . In this scenario, N = 1000 uniformly and independently distributed nodes (uniform random placement) in a 
2000 m x 2000 m square area. An average cluster radius of Rc = 200 m and the PU is placed randomly in the entire area with 
uniform probability (Xc = m, ai =0 m). 

1 ) Comparison of Accuracy: Fig. |7] presents simulation results that compare normalized mean error 
of CWCL, SN and DWCL under different shadowing environments. SN is included in the comparison 
because it forms the basis of the head cluster selection for DWCL so it acts as an upperbound to the 
error. We observe that the proposed DWCL performance is between CWCL and SN algorithms. It is 
also interesting to note that the accuracy of DWCL degrades relative to CWCL and approaches the 
performance of SN when shadowing is high. The reason behind this is that for cluster head selection, 
DWCL is dependent on the SN algorithm. The accuracy of the SN degrades at large shadowing shown 
in Fig. [6] 

2) Comparison of Transmit Power: Fig. [8] compares the transmit power consumption for CWCL and 
DWCL with different number of clusters. In the simulation we set P r: min = — 70dBm which is the mean 
received power at the border of the area of R = 100 m, assuming Pq = dBm, do = 1 m and 7 = 3.8 in 
CD). We observe that DWCL with 16 clusters consumes about 15dBm less power for each node compared 
with CWCL. The analysis also shows that transmit power of each node is reduced as the number of node 
increases, since the average communication distance is getting smaller in DWCL. 

3) Comparison of Number of OPS: Fig. [9] compares computational complexity estimated in terms 
of number of OPS. It shows that distributed algorithms require about three times more operations than 
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Fig. 8. Analysis and simulation results for average transmit power per node for CWCL and DWCL algorithms with varying 
levels of shadowing with fixed grid placement (7? = 100 m, ai = m). 
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CWCL, however total transmit power is reduced. 
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VI. Conclusion 

In this paper we present the first analytical framework for the performance of WCL for localization 
of PU transmitters in CR networks. We derived the closed-form expression for the error distribution of 
WCL parameterized by node density, node placement, shadowing variance, correlation distance and errors 
in sensor nodes positions. Using the theoretical results in conjunction with numerical simulations, the 
robustness of WCL against various physical conditions is investigated and quantified. Based on the above 
analysis, we provide guidelines in terms of number of nodes and node placement required to achieve 
certain localization accuracy using WCL and propose variable node participation for improved WCL 
performance. To facilitate the deployment of WCL in an energy efficient context, a distributed cluster- 
based implementation of WCL is also proposed, which does not assume a fusion center that collects all 
RSS information from nodes. The distributed method achieves comparable accuracy with its centralized 
counterpart, and greatly reduces total power consumption. 

Appendix 

Correlation Coefficient Between Localization Errors In X And Y Dimensions 

We derive the correlation coefficient between localization errors in x and y dimensions, Px pVp , which 
is given by 

Px p y p <Jx p <?y p = E[x p y p ] - m Xp m yp . (23) 

For notational convenience we define qi = Pj — P mm ~ A/"(/tj , a 2 s ) and form the vector q = 
[<7i,<72, • • • ,1m] T , so that q ~ J\f(fi,fl) with ily = of e~ll Li ~ L 3'll/ j5f < ! . Note that this definition includes 
both i.i.d and correlated shadowing case. The first term in (l23l then becomes 



E[x p y p ] = E 



T T 
x J qq 1 y 



(24) 



l T qq T l 

Define three new variables t\ = x T q, £2 = y T q and £3 = l T q, and their vector form t = [£i, £2, *3] T - 
Now the first term in ([23]) becomes E[(£i£ 2 )/£§]. As shown in Sec. II, x ~ A/"(x, of I), and y ~ Af{y, of I). 
£1 can be approximated as a single Gaussian variable as shown in Sec. IIII-A1I since it is the same as a 
in ((6]). We denote its distribution as t\ ~ N(mi,af), similarly £2 ~ N(rri2, a%). The distribution of £3 
is given as £3 ~ J\f(ms, cr|), where 7713 = l T /x and cr| = l T f21. 

As a result t ~ M(t, ft t ), where t = [mi, m,2, m^] T is the mean vector and f2 t is the covariance matrix. 
To derive f2 t we find Cov(£i,£2) as 

N N 

Cov(£i,£ 2 ) = E[£i£ 2 ] - mim 2 = ^[{/, VjQj)] ~ w>im 2 = x T f2y - mim 2 (25) 

i=i j=i 
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where in the last the equation we used the fact E^g^g.,] = Xii/jQij. Similarly we calculate all entries 
of ft t . 

In order to simplify the calculation of (|24b . we perform the following transformation to t, t = l+fl t 
thus t ~ JV(0, 1). Clearly ti = rrii + ojfi, i = 1,2,3, where is the ith column of fi t T l 2 _ Then we 
have 



"3 (?"3 + U%t) 2 

Since the expectation in (1261 involves multiplication and division of three correlated random variables, 
the calculation can not be done directly. In order to reduce the number of variables involved, first we 
form A = [w3,wi,a)2l, then perform QR factorization to reduce dimensionality, obtaining Q T A = R, 
in which Q is orthogonal and R is upper triangular 



R 



(27) 



ru ri2 r 13 
r 22 r 23 
r 33 j 

From the orthogonal-invariant property of standard Gaussian variables 1251 . the distribution of Q^t is 
the same as that of t, which is denoted as Q t = t. 

Note that since cj^QQ^t = oS^t, denominator of d26l can further be expressed as 



(m 3 + u^QQ 7 !) 2 = (ms + ojjQiY = (m 3 + [r u ,0,0] tY = (m 3 + r u h) 



(28) 



where we use the fact that Q T o^ 3 is the first column of R; and t = [ti,t2,t 3 ] T . Similarly, numerator of 
(1261) can also be further expressed as 

(mi + u?ft)(m 2 + wp) = (mi + n 2 ti + ^22*2) [m 2 + n 3 ti + r 23 i2 + ^33^3) (29) 
Therefore, the expectation in (1241) becomes 

11 ^ V, ^12^13 ^ 



E[x p y p ] 



H2H3 



11 



72 1 / mi 1 m 2 



(30) 



After factorization (1301 becomes 
/ 



E[x p y p ] 



H2H3 



11 



1 + 
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mi 1 1% 
ri2 r i3 



'»3 
''11 



+ 



m 1 m 2 +r 2 2r23 
ri 2 r 13 
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mi _|_ rri2 \ rri3 



ri2 r 13 I ru 



1+ ( 2J3 



(31) 



Substituting yTj) into (1231 gives the result. 
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